Myhill-Nerode Theorem for Sequential Transducers over Unique GCD-Monoids
نویسنده
چکیده
We generalize the classical Myhill-Nerode theorem for finite automata to the setting of sequential transducers over unique GCDmonoids, which are cancellative monoids in which every two non-zero elements admit a unique greatest common (left) divisor. We prove that a given formal power series is sequential, if and only if it is directed and our Myhill-Nerode equivalence relation has finite index. As in the classical case, our Myhill-Nerode equivalence relation also admits the construction of a minimal (with respect to the number of states) sequential transducer recognizing the given formal power series. Deterministic finite automata and sequential transducers are applied, for example, in lexical analysis, digital image manipulation, and speech processing [2]. In the latter application area also very large sequential transducers, i.e., transducers having several million states, over various monoids are encountered [2], so without minimization algorithms [4] the applicability of sequential transducers would be severely hampered. In [2, 3] efficient algorithms for the minimization of sequential transducers are presented in case the weight is taken out of the monoid (∆∗, ·, ε) or out of the monoid (IR+,+, 0). A Myhill-Nerode theorem also allowing minimization is well-known for sequential transducers over groups [1]. We use (A, ,1,0) to denote a monoid with the absorbing element 0. A unique GCD-monoid is a cancellation monoid (A, ,1,0) in which (i) a|1 implies a = 1, (ii) a greatest common divisor (gcd) exists for every two non-zero elements, and (iii) a least common multiple (lcm) exists for every two non-zero elements having a common multiple. Unique GCDmonoids exist in abundance (e.g., (IN ∪ {∞},+, 0,∞) and (IN, ·, 1, 0) as well as the monoids mentioned in the previous paragraph). A sequential transducer (ST) is a tuple M = (Q, q0, F,Σ, δ,A, a0, μ) where (i) Q is a finite set, (ii) q0 ∈ Q, (iii) F ⊆ Q, (iv) Σ is an alphabet, (v) δ : Q×Σ −→ Q, (vi) A = (A, ,1,0) is a monoid, (vii) a0 ∈ A \ {0}, ? Financially supported by the German Research Foundation (DFG, GK 334/3) and (viii) μ : Q×Σ −→ A. For every q ∈ Q the mappings δ̂q : Σ∗ −→ Q and μ̂q : Σ∗ −→ A are recursively defined by (i) δ̂q(ε) = q and μ̂q(ε) = 1, and for every w ∈ Σ∗ and σ ∈ Σ by (ii) δ̂q(w·σ) = δ(δ̂q(w), σ) and μ̂q(w·σ) = μ̂q(w) μ(δ̂q(w), σ). Finally, the power series SM ∈ A〈〈Σ∗〉〉 recognized by M is then defined to be (SM , w) = a0 μ̂q0(w), if δ̂q0(w) ∈ F , otherwise 0. We call a power series S ∈ A〈〈Σ∗〉〉 sequential (with respect to A), if there exists a sequential transducer M such that S = SM . In the following, let A = (A, ,1,0) be a unique GCD-monoid, M = (Q, q0, F,Σ, δ,A, a0, μ) be a ST, and S ∈ A〈〈Σ∗〉〉. Moreover, we use g(w) = gcdu∈Σ∗, w·u∈supp(S)(S,w·u) for every w ∈ Σ∗. If (S,w) = g(w) for all w ∈ supp(S), then S is called directed. Definition 1. The ST M is normalized, if there exists ⊥ ∈ Q\(F∪{q0}) such that δ(⊥, σ) = ⊥ for every σ ∈ Σ and μ(q, σ) = 0 ⇐⇒ δ(q, σ) = ⊥ for every q ∈ Q. Definition 2. We define the Myhill-Nerode relation ≡S ⊆ Σ∗ × Σ∗ by w1 ≡S w2, iff there exist a1, a2 ∈ A \ {0} such that for every w ∈ Σ∗ w1·w ∈ supp(S) ⇐⇒ w2·w ∈ supp(S) and a−1 1 g(w1·w) = a −1 2 g(w2·w). Proposition 3. If S is directed and ≡S has finite index, then there exists a sequential transducer M with index(≡S) states such that SM = S. Proof. In the proof we write [w] and [Σ∗] instead of [w]≡S and [Σ ]≡S . Let M = (Q, q0, F,Σ, δ,A, a0, μ) where for every w ∈ Σ∗ and σ ∈ Σ (i) Q = [Σ∗], q0 = [ε], F = { [w] | w ∈ supp(S) }, (ii) δ([w], σ) = [w·σ], a0 = g(ε), and μ([w], σ) = g(w)−1 g(w·σ). Moreover, the constructed ST is minimal with respect to the number of states amongst all normalized deterministic ST computing S. Theorem 4. The following are equivalent. (i) S is directed and ≡S has finite index. (ii) S is sequential.
منابع مشابه
Characterisation of (Sub)sequential Rational Functions over a General Class Monoids
In this technical report we describe a general class of monoids for which (sub)sequential rational can be characterised in terms of a congruence relation in the flavour of Myhill-Nerode relation. The class of monoids that we consider can be described in terms of natural algebraic axioms, contains the free monoids, groups, the tropical monoid, and is closed under Cartesian. 1 Inroduction The pro...
متن کاملMyhill-Nerode Relation for Sequentiable Structures
Sequentiable structures are a subclass of monoids that generalise the free monoids and the monoid of non-negative real numbers with addition. In this paper we consider functions f : Σ∗ → M and define the Myhill-Nerode relation for these functions. We prove that a function of finite index, n, can be represented with a subsequential transducer with n states.
متن کاملLearning Sequential Tree-to-Word Transducers
We study the problem of learning sequential top-down tree-toword transducers (stws). First, we present a Myhill-Nerode characterization of the corresponding class of sequential tree-to-word transformations (ST W). Next, we investigate what learning of stws means, identify fundamental obstacles, and propose a learning model with abstain. Finally, we present a polynomial learning algorithm.
متن کاملA Myhill-Nerode Theorem beyond Trees and Forests via Finite Syntactic Categories Internal to Monoids
The paper introduces recognizable languages as inverse images of sets of arrows from finite categories internal to monoids. The first result is the Myhill-Nerode Theorem as a conservative extension of the classic result for tree languages. The second result shows that a language of planar acyclic circuit diagrams whose gates have non-empty lists of input and output ports is recognizable if, and...
متن کاملOn normalization and type checking for tree transducers
Tree transducers are an expressive formalism for reasoning about tree structured data. Practical applications range from XSLT-like document transformations to translations of natural languages. Important problems for transducers are to decide whether two transducers are equivalent, to construct normal forms, give semantic characterizations, and type checking, i.e., to check whether the produced...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004